ND0063 C1 L4 19 Lesson Recap Video

Lesson Recap

Monitoring
Alerting
Recovering
Automating

Lesson Objectives

You will be able to:

Monitor AWS applications
Alert on problems in applications
Recover failures in your platform
Understand testing and tradeoffs in automating recovery from failure

In this lesson, you learned how to monitor and maintain systems in AWS. You also looked at what and how to recover systems that have failed. The larger your application grows, the more parts and services it will have. The more complex it grows, the more things that can go wrong. The more things that can go wrong, the more frequently they will go wrong. Expect failures, and plan to address and recover from them.

Glossary

SSL certificate: Cryptographic certificate for encrypting traffic between two computers.
Source of truth: When data is stored in multiple places or ways, the "source of truth" is the one that is used when there is a discrepancy between the multiple sources.
Monitoring: Systems to track and make visible metrics that are useful in identifying system performance.
Alerting: Systems to attract attention when performance thresholds are crossed.
Chaos Engineering: Intentionally causing issues in order to validate that a system can respond appropriately to problems.

21. Lesson Recap

ND0063 C1 L4 19 Lesson Recap Video

Lesson Recap

Lesson Objectives

Glossary

Further Reading